Creating Knowledge Repositories from Biomedical Reports: The MEDSYNDIKATE Text Mining System
نویسندگان
چکیده
MEDSYNDIKATE is a natural language processor for automatically acquiring knowledge from medical finding reports. The content of these documents is transferred to formal representation structures which constitute a corresponding text knowledge base. The system architecture integrates requirements from the analysis of single sentences, as well as those of referentially linked sentences forming cohesive texts. The strong demands MEDSYNDIKATE poses to the availability of expressive knowledge sources are accounted for by two alternative approaches to (semi)automatic ontology engineering. We also present data for the knowledge extraction performance of MEDSYNDIKATE for three major syntactic patterns in medical documents.
منابع مشابه
MedSynDikate - a natural language system for the extraction of medical information from findings reports
MEDSYNDIKATE is a natural language processor, which automatically acquires medical information from findings reports. In the course of text analysis their contents is transferred to conceptual representation structures, which constitute a corresponding text knowledge base. MEDSYNDIKATE is particularly adapted to deal properly with text structures, such as various forms of anaphoric reference re...
متن کاملChapter 3 Lexical, terminological and ontological resources for biological text mining
Biomedical terminologies and ontologies are frequently described as enabling resources in text mining systems [e.g., 1, 2, 3]. These resources are used to supports tasks such as entity recognition (i.e., the identification of biomedical entities in text) and relation extraction (i.e., the identification of relationships among biomedical entities). Although a significant part of current text min...
متن کاملIntelligent Approaches to Mining the Primary Research Literature: Techniques, Systems, and Examples
In this chapter, we describe how creating knowledge bases from the primary biomedical literature is formally equivalent to the process of performing a literature review or a ‘research synthesis’. We describe a principled approach to partitioning the research literature according to the different types of experiments performed by researchers and how knowledge engineering approaches must be caref...
متن کاملRecent progress in automatically extracting information from the pharmacogenomic literature.
The biomedical literature holds our understanding of pharmacogenomics, but it is dispersed across many journals. In order to integrate our knowledge, connect important facts across publications and generate new hypotheses we must organize and encode the contents of the literature. By creating databases of structured pharmocogenomic knowledge, we can make the value of the literature much greater...
متن کاملLexical, Terminological, and Ontological Resources for Biological Text Mining
Biomedical terminologies and ontologies are frequently described as enabling resources in text mining systems [1–3]. These resources are used to support tasks such as entity recognition (i.e., the identification of biomedical entities in text), and relation extraction (i.e., the identification of relationships among biomedical entities). Although a significant part of current text mining effort...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing
دوره شماره
صفحات -
تاریخ انتشار 2002